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ABSTRACT 

Structural antibody database (SAbDab; http://opig. 
stats.ox.ac.uk/webapps/sabdab) is an online 
resource containing all the publicly available 
antibody structures annotated and presented in a 
consistent fashion. The data are annotated with 
several properties including experimental informa- 
tion, gene details, correct heavy and light chain 
pairings, antigen details and, where available, 
antibody-antigen binding affinity. The user can 
select structures, according to these attributes as 
well as structural properties such as complementar- 
ity determining region loop conformation and 
variable domain orientation. Individual structures, 
datasets and the complete database can be 
downloaded. 

INTRODUCTION 

Antibodies form the foundations of the vertebrate 
immune response. These proteins form complexes with 
potentially pathogenic molecules called antigens and 
inhibit their function or recruit other components of the 
immunological machinery to destroy them. In addition to 
the biological importance of antibodies, their abihty to be 
raised against an almost limitless number of molecules has 
made them useful laboratory tools and increasingly useful 
as therapeutic agents in humans (1). This biopharniaceut- 
ical application has motivated the desire to understand 
how binding, stability and immunogenic properties of 
the antibody are determined and how they can be 
modified. 

Computational analyses and tools are increasingly 
being employed to aid the antibody engineering process 
(2). Many of these tools now use only the antibody data, 
as opposed to general protein data, because this has been 
shown to increase performance (3,4). 

The publicly available structural data for most types of 
proteins are too sparse to merit protein-specific prediction 
methods. However, since the first antibody structure was 



deposited in 1976 (5), the number of antibody structures in 
the protein data bank (PDB) (6) has grown, and it now 
represents approximately 1.75% of the total 91939 entries 
(July 2013). 

Several databases that handle antibody data currently 
exist (7-13). Of these, most are sequence-based or are 
antibody discovery tools. The most recent, DIGIT (13), 
provides sequence information for immunoglobuhns and 
has the advantage over earlier sequence databases [Kabat 
(7), IMGT (9), Vbase2 (8)] of providing heavy and hght 
chain sequence pairings. However, it does not incorporate 
structural data. AntigenDB (11) and IEDB-3D (12) do 
include structural data. However, both focus on collecting 
epitope data and do not include unbound antibody struc- 
tures. In comparison, both IMGT (9) and the Abysis 
portal (10) provide the ability to inspect and download 
individual Ijound and unbound antibody structures. 
Neither allow for the generation of bespoke datasets nor 
for the download of an ensemble of curated structural 
data. 

To address this problem, we have developed a 
Structural Antibody Database (SAbDab), a database 
devoted to automatically collecting, curating and present- 
ing antibody structural data in a consistent manner for 
both bulk analysis and individual inspection. SAbDab 
updates on a weekly basis and provides users with a 
range of methods to select sets of structures. For 
example, users can select by species, experimental details 
(e.g. method, resolution and r-factor), similarity to a given 
antibody sequence, amino-acid composition at certain 
positions and antibody-antigen affinity. Entries can also 
be selected using structural annotations including, for 
example, the canonical form of the complementarity 
determining regions (CDR) (14), orientation between the 
antibody variable domains (15) and the presence of 
constant domains in the structure. Structures can be in- 
spected individually or downloaded en masse either as the 
original file from the PDB or as a structure that has been 
annotated using the Chothia numbering scheme (16). In 
aU cases, a tab-separated file detaihng heavy and hght 
chain pairing, antibody-antigen pairing and all other an- 
notations is generated. 
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Antibody structure nomenclature 

Antibodies liave a well-defined structure that is conserved 
over majority of the molecule. They typically consist of 
four polypeptide chains, two light chains and two longer 
heavy chains (see Figure 1). Each light chain folds to form 
two domains, one variable (VL) and one constant (CL). 
Each heavy chain folds to form four or more domains, one 
variable (VH) and three or more constant domains (CHI, 
CH2 and CH3). The VL and CLl domains from one hght 
chain associate with the VH and CHI domains of a heavy 
chain to form an antigen-binding fragment (F^^). Two 
i^^^s form the arms of the Y-shaped structure of the 
antibody. The remaining constant domains on each 
heavy chain (CH2 and CHS) associate to form the stem 
of the Y and are known collectively as the crystallisable or 
constant {Fc) fragment. 

Typically, a natural antibody has two identical antigen- 
binding sites, one at the tip of each F^]^ arm. On both 
domains, VH and VL, of the variable fragment (collect- 
ively termed the Fy) are three CDRs: HI, H2 and H3 on 
VH and LI, L2 and L3 on VL. Five of the six CDRs have 
structures that can be classified into 'canonical clusters' 



(16). The remaining loop, H3, is more variable and 
cannot be treated in the same way (17). In fact, modelhng 
the H3 loop remains one of the most difficult challenges in 
antibody structure prediction (2). 

The residues in each variable domain outside the CDRs 
are referred to as the framework regions. The framework 
is relatively conserved in sequence and has a P-sandwich 
architecture. This conserved structure allows for equiva- 
lent residue positions to be annotated from antibody to 
antibody. Several numbering systems exist that are largely 
similar over the framework region but have different def- 
initions around the CDRs. Here, we primarily use the 
widely adopted Chothia numbering scheme (16), as it is 
informed by structural analysis and is defined over the 
entire variable region. 



DATA SOURCES AND CONTENTS 

Antibody structures 

As of 25 July 2013, the database contains 1624 structures 
with one or more antibody chains. Of these, 1418 have 
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Figure 1. SAbDab's workflow. Each week new structures from the PDB are analyzed to find antibody chains. These structures are then annotated 
with a number of properties and stored in SAbDab. Users may access and select this data using a number of different criteria. Structures and 
annotations can be downloaded individually or as a dataset. Inset, a schematic of the IgG antibody structure and the F, fragment formed by the 
heavy and hght variable domains, VH and VL. 
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at least one paired heavy and light chain that form a Fab- 
The remainder are largely single-domain antibodies or are 
cases when only one antibody chain has been crystallized. 
SAbDab is updated on a weekly basis using the technique 
summarized in Figure 1 and detailed below. The database 
is currently growing at a mean rate of six new structures 
per week. 

Each week, the PDB releases new experimental 
structures. Using key word searches, it is possible to 
identify most of those that contain an antibody chain. 
However, no direct or consistent information is 
given about chain type, heavy-hght chain pairings or anti- 
body-antigen chain pairings. Therefore, SAbDab 
attempts to apply the Chothia antibody numbering to 
the sequence of each new chain using ABnum (18). This 
automatically detects each chain's type — heavy, hght or 
non-antibody. The process is applied recursively to 
sequences to identify each variable region of the chain 
and thus enable the identification of single-chain i^,,s 
(scFvs) that have not been split into separate chains. 
Those non-antibody chains that belong to a PDB entry 
containing an unequal number of heavy and hght chains 
are ahgned to antibody sequence profiles using MUSCLE 

(19) . A chain must have a sequence identity of <35% to 
any antibody sequence profile for it to be considered a 
potential antigen. Those that exceed this threshold are 
flagged for manual inspection. In addition, any structure 
whose header details contain words similar to 'T-ceh' or 
'MHC are flagged for manual inspection before their 
inclusion in SAbDab. 

To pair heavy and hght chains, the constraint is applied 
that the conserved cysteine at Chothia position 92 on a 
heavy chain must be within 22 A of the conserved cysteine 
at position 88 on a light chain. Potential antigens are 
identified from the non-antibody chains and the non- 
polymer, nucleic-acid or carbohydrate molecules. Those 
smaU molecules that are recognized as common solvents 

(20) (e.g. glycerol) are discarded. Antibody chains are then 
paired with their antigen molecules by calculating the 
number of CDR residues that are within 7.5 A of each 
candidate. If there is more than one molecule that 
makes contact with the antibody CDRs, the structure is 
flagged for manual inspection. Polypeptide antigens are 
classified as proteins if they contain >50 amino 
acids and peptides otherwise. Only the bound polypeptide 
chains are reported as the antigen. Other antigens 
are either classified as carbohydrates, nucleic acids or 
haptens (non-polymeric hgands). The antibody- 
antigen complex content (July 2013) is summarized in 
Table 1. 

Annotations are obtained for the antibodies and 
antigens from a number of external sources. If the entry 
exists in the IMGT database, annotations of allele, gene, 
subgroup, group and isotype are collected. Where no 
IMGT entry exists, each antibody chain is annotated 
down to the subgroup level by ahgnment to representative 
sequences. Experimental details are collected from the 
PDB. Details about the name, molecule type and structure 
of non-peptide hgands are obtained from the hgand-expo 
database (6). 



Table 1. SAbDab's current antibody-antigen complex content 



Antigen type 


No. of structures 


No. of 
Fy regions 


Protein 


604 


1081 


Peptide 


224 


321 


Carbohydrate 


64 


86 


Nucleic-acid 


12 


15 


Hapten 


147 


224 


Total contents 


1624 


3048 



Multiple Fy regions may appear in a single PDB structure. 



Affinity data 

Antibody binding affinity data are primarily obtained 
froin two databases, PDB-Bind (21) and the structure- 
based benchmark (22). All the antibody entries were 
selected and only those with or data were kept. 
Where available, meta-data that are pertinent to affinity 
data (e.g. experimental conditions) are also collected. 

Currently, SAbDab contains 190 structures with an 
associated affinity value. In total, 133 are bound to 
proteins, 38 to peptides and 19 to hapten antigens. This 
curated data set should serve as a useful benchmarking 
resource for the antibody-antigen docking prediction 
community and the antibody engineering community. 

Complementarity determining regions 

There are inultiple characterizations of antibody CDRs 
(16,23-25). In SAbDab, the Kabat (23), Contact (24) 
and Chothia (16) CDRs are annotated. The length and 
sequence of the CDRs, according to these three defin- 
itions, is extracted for each structure and recorded in 
SAbDab. In the database, the Chothia CDRs (16) are 
further analyzed to assign membership into structural 
clusters, often referred to as canonical conformations. 

The canonical conformations of a given CDR type and 
length were originally created with the aim of linking 
sequence with structure. These groupings have been 
studied extensively (14,16,26-29). Given the exponential 
growth of the number of antibody structures in the PDB 
(Figure 2), we provide a standardized tool for studying the 
structural classes of CDRs. SAbDab regularly clusters the 
latest set of Chothia CDRs for each type (HI, H2, H3, LI, 
L2 and L3) and length. The clustering is performed by 
calculating the pairwise root mean square deviation 
between the CDRs and using a UPGMA clustering algo- 
rithm (30) at a number of cut-offs. Any correspondence to 
previously defined canonical classes is noted for each 
cluster. This feature will automatically monitor the con- 
fonnational space of the CDRs as the amount of antibody 
structural data continues to increase. 

VH-VL orientation 

The antigen binding site is formed between the variable 
domains, VH and VL, of an antibody. The topology of the 
site is therefore influenced by how the domains are 
orientated with respect to one another. Optimizing the 
VH-VL orientation has been proposed as a mechanism 
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Figure 2. The number of antibody structures in the PDB is rising 
rapidly. On an average, six new antibody structures are added to 
SAbDab each week. 



to fine tune antibody-antigen affinity. Indeed, in 
humanization experiments, affinity is found to be 
regained after making mutations that are distant from 
the antigen-binding site and therefore indicative of a struc- 
tural change, modifying the VH-VL orientation (31-33). 
In SAbDab, we use the ABangle methodology (15) that 
characterizes the orientation in an absolute sense using six 
measures, five angles and a distance. These measures allow 
for the orientation space of antibodies to be characterized. 
In SAbDab, we automatically calculate these measures for 
each Fy region in the database. 



ACCESSING THE DATA 

The data in SAbDab can be accessed and filtered in a 
number of ways. Details of particular structures can be 
retrieved and viewed or sets of entries can be selected 
and downloaded. In addition, the entire structural 
contents of SAbDab can be downloaded. 

Downloads 

For each structure, the following files may be downloaded: 

(1) The pdb structure file 

(2) A Chothia re-numbered structure file 

(3) A tab-separated summary file containing information 
about chain pairings, antigen pairing and other 
annotations about the structure gathered by 
SAbDab. 

The structure files are available in PDB format. The 
Chothia re-numbered file contains the coordinates of 
each atom in the structure. Each antibody residue is 
renumbered with the Chothia numbering scheme over 
the variable region of domains. Non-variable region 
residues are numbered sequentially. Non-antibody chains 
retain their original residue numbering. The header of 
each file contains information about the chain types, 
pairings and antigen pairings. For instance, the structure 
lahw (34) has two heavy-light chain pairs: B-A and E-D. 



These associate with protein antigen chains C and F, 
respectively. Thus, the header contains the hues: 

REMARK 5 PflIRED_HL HCHAIN=B LCHAIN=A AGCHAIN=C AGTYPE=PROTE IN 
REMARK 5 PfllREDJL HCHAIN=E LCHAIN=D AGCHAIN=F AGTYPE=PROTE IN 

The summary file is a tab-separated. tsv file containing 
information about chain pairings and details about the 
structure, for example, experimental details, antigen affin- 
ity and species. The first fine is the name of each field. 
Each following fine corresponds to a paired heavy and 
hght antibody chain and details corresponding to that 
pairing. For instance, the first six fields of the summary 
file for lahw appear as: 

pdb Hchain Lchain model antigen_chain antigen_type . . . 
lahw B A 0 C protein 

lahw E D 0 F protein 

When a user selects any set of structures, they are able 
to download the files for each structure individually 
or collectively as a dataset using the 'download all' 
function. In the latter case, a single zip file is created 
containing an archive of all the selected structures. A 
single summary file is also created for all the heavy- and 
hght-chain pairings in the selection. This file may also be 
downloaded separately without the structural data. 

Individual structure information 

An individual structure can be accessed using its PDB 
accession code (e.g. lahw). When a structure is accessed, 
the user is brought to its summary page as shown in 
Figure 3a. Here, the structure can be visualized with 
heavy chain, hght chain, antigen and CDRs annotated 
in different colours. Clicking on the structure information 
tab shows details including experimental method used to 
acquire the structure, species information, the number of 
paired heavy and light chains and, if available, the 
associated and AG values for antibody-antigen 
binding. 

Under the paired chains information tab, further details 
about each paired heavy and hght chain (F^^) can be 
found. These include: H and L chain identifiers, the 
bound state of the ¥^8, the IMGT subgroup gene anno- 
tations, the Chothia numbered sequence of each chain, 
information about each CDR and the orientation 
measures between the VH and VL domains. If present, 
details of the antigen and its sequence are provided. 

The summary page also aUows the user the fuU set of 
download options. Links are also provided to the original 
PDB entry and, if available, to the entry for the structure 
in IMGT. 



Advanced search tool 

The advanced search tool (Figure 3b) allows the user to 
select structures based on a number of attributes. 
The attributes include experimental method, resolution 
cut-off (for x-ray structures), r-factor, bound-state 
(bound or unbound), antigen type, antibody species and 
antibody light chain type (k or X). Users can also specify 
amino-acid types that must be present at Chothia pos- 
itions. Similarly, structures can be limited to those that 
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Figure 3. Selected screen-shots of SAbDab (a) The structure summary page for an entry in SAbDab. Detailed information about the structure and a 
visualization of the antibody and antigen is available, (b) The advanced search form. Structures may be selected using a number of methods. Here, 
the advanced search selects the required attributes of each structure in the selection, (c) The alignment between a query sequence and a template 
identified by the template search function, (d) The ABangle orientation search tool. Users may select Fy structures by choosing specific regions of the 
VH-VL orientation space. 
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have an associated affinity value or those that have the 
constant domains of the F^g region present. 

After clicking on the 'get structures' button, the user 
will be presented with a list of structures that satisfy 
their selection. Basic information is shown for each struc- 
ture with a link to each entry's summary page. The 'down- 
loads' section of the results page provides options to 
download the selected structures. 

Non-redundant dataset creation 

The antibody and antigen structures in the PDB are highly 
redundant in terms of sequence. For instance, 6% of the 
bound antigens in SAbDab are lysozymes. Over represent- 
ing certain types of antigens in analysis datasets may bias 
results, especially in the antibody-antigen docking field, 
where algorithms may be trained using paratope-epitope 
contacts. To overcome this problem, we provide a non- 
redundant dataset creation tool. Structures are clustered 
using cd-hit (35) based on their sequence identity with 
respect to both antibody and antigen sequences. Users 
may select sequence identity levels for the antibody and 
antigen separately and specify other constraints for the 
structures returned. 

CDR search tools 

SAbDab offers a CDR-specific search functionality. 
A user may select CDRs using similar criteria as in the 
advanced search tool ('advanced search' section). In 
addition, CDR structures can be searched with respect 
to their CDR type and length in accordance with different 
CDR definitions and their membership of structural 
clusters or canonical classes ('complimentarity 
determining regions' section). SAbDab will return a list 
of the selected CDR structures. These can be inspected 
individually or downloaded as described in the 'down- 
loads' section. The CDR search tool also allows a non- 
redundant set of CDR structures to be selected. In this 
case, only non-identical structures with respect to type, 
length and sequence are returned. For identical sequences, 
the structure with the best resolution is returned. 

Template search tool 

The template search tool allows users to identify those 
structures in SAbDab with the highest sequence identity 
to a given antibody sequence. The returned entries may 
act as good templates for use in a modelling protocol. 
Structures can be searched according to their sequence 
identity over either the heavy or hght chain or over both 
chains at once. Users may specify whether they wish to 
calculate sequence identity over the full variable region, 
only the framework regions, only the CDRs or only a 
particular CDR. An option is also provided that 
requires each template to have the same structurally 
equivalent positions as the query sequence i.e. that there 
are no insertions or deletions between the template and 
query. 

On submission, the top N templates (as specified by the 
user) are returned, ranked by their matched sequence 
identity to the query. Each structure may be inspected 
individually and the Chothia-numbered ahgnment 



between the template and the query sequence visualized 
(Figure 3c). An option is given to download all returned 
structures individually or en masse along with a multiple 
sequence alignment of the template sequences to the query 
sequence. 

ABangle search tool 

As described in the 'VH-VL orientation' section, the 
orientation between the variable domains can be 
characterized using six absolute measures. Users can 
explore the VH-VL orientation space using our ABangle 
search tool (Figure 3d). The distribution of each measure 
has been divided into discrete bins. To select structures 
with a particular orientation, a user may click on one or 
multiple (or none) bins for each of the distributions. On 
submission, each ¥y region with a VH-VL orientation 
that falls within the selected orientation range will be 
returned. Alternatively, the same criteria as in the 
'advanced search' section can be used to select structures 
and visualize where they lie in orientation space. For 
instance, if a user selected structures with a prohne (P) 
at Chothia position L44, these would show a different 
orientation preference to those with a tryptophan (W) at 
the same position (15,36,37). 

The 'select by pdb code' function allows for the selec- 
tion of a number of individual structures for comparison 
of their VH-VL orientation. One application of this tool is 
to compare the VH-VL orientation of antibodies in their 
bound and unbound form. For example, the HIV-1 
neutralizing antibody 50.1 has been crystalhzed both 
without and in complex with its peptide antigen (38). 
These structures have been cited as evidence for conform- 
ational changes in the antibody upon antigen-binding. 
Interestingly, it is the unbound form (IGGC and IGGB) 
that has an unusual orientation, while the bound from 
(IGGI) has an orientation typical of known antibody 
structures. 



CONCLUSION 

SAbDab collects, curates and presents antibody structures 
from the PDB in an consistent manner. The aim of the 
database is to provide the antibody research community 
with a tool to easily create standardized datasets for 
analysis and to monitor the rapidly increasing amount 
of available antibody structural data. Automated weekly 
updates keep the data in SAbDab up to date and ensure 
the longevity of this resource. The database is comple- 
mented by further stand-alone antibody software that 
can be found under the 'Tools' section on the SAbDab 
front page. We hope that SAbDab provides a useful 
resource for computational and experimental antibody re- 
searchers ahke. The database is entirely open-access and 
available at http://opig.stats.ox.ac.uk/webapps/sabdab. 

FUNDING 

Engineering and Physical Sciences Research Council 
(EPSRC); UCB Pharma; Roche GmbH. Funding for 
open access charge: EPSRC. 



D1146 Nucleic Acids Research, 2014, Vol. 42, Database issue 



Conflict of interest statement. None declared. 



REFERENCES 

1. Walsh,G. (2010) Biopharmaceutical benchmarks 2010. Nat. 
BiotL'chnol., 28, 917-924. 

2. Kuroda.D., Shirai,H., Jacobson,M.P. and Nakamura,H. (2012) 
Computer-aided antibody design. Protein Eiig. Des. Set., 25, 
507-522. 

3. Choi.Y. and Deane,C.M. (2011) Predicting antibody 
complementarity determining region structures without 
classification. Mol. BioSyst., 7, 3327-3334. 

4. Krawczyk.K., Baker,!., Shi,J. and Deane,C.M. (2013) Antibody 
i-Patch prediction of the antibody binding site improves rigid 
local antibody-antigen docking. Protein Eng. De.i. Set., 26, 
621-629. 

5. Epp,0., Lattman,E.E., Scliiffer,M., Huber,R. and Palm.W. (1975) 
The molecular structure of a dimer composed of the variable 
portions of the Bence-Jones protein REI refined at 2.0-A 
resolution. Biocitemi.strv, 14, 4943^952. 

6. Berman.H.M., Westbrook,J., Feng.Z., Gilliland,G., Bhat,T.N., 
Weissig.H., Shindyalov,I.N. and Bourne.P.E. (2000) The Protein 
Data Bank. Nucleic Acids Res.. 28, 235-242. 

7. Johnson.G. and Wu.T.T. (2001) Kabat Database and its 
applications: future directions. Nucleic Acids Res.. 29, 205-206. 

8. Retter,!., Althaus,H.H., Munch,R. and Miiller.W. (2005) 
VBASE2, an integrative V gene database. Nucleic Acids Res., 33, 
D671-D674. 

9. Lefranc.M.P., Giudicelh.V., Ginestoux,C., Jabado-Michaloud,J., 
Folch,G., Bellahcene,F., Wu,Y., Gemrot,E., Brochet,X., Lane,J. 
et al. (2009) IMGT, the international ImMunoGeneTics 
information system. Nucleic Acids Res., 37, D1006-D1012. 

10. Martin.A.C.R. (2010) Antibody Engineering, Vol. 2. Protein Sc. 
Springer, Berhn Heidelberg, pp. 33-51. 

11. Ansari,H.R., Flower.D.R. and Raghava.G.P.S. (2010) AntigenDB: 
an immunoinformatics database of pathogen antigens. Nucleic 
Acids Res., 38, D847-D853. 

12. Ponomarenko.J., Papangelopoulos,N., Zajonc,D.M., Peters,B., 
Sette,A. and Bourne,P.E. (2011) IEDB-3D: structural data within 
the immune epitope database. Nucleic Acids Res., 39, 
D1164-D1170. 

13. Chailyan,A., Tramontano,A. and Marcatili.P. (2012) A database 
of immunoglobulins with integrated tools: DIGIT. Nucleic Acids 
Res., 40, D1230-D1234. 

14. Chothia.C, Lesk,A., Tramontano,A., Levitt,M., Smith-Gill, S. J., 
Air,G., Sheriff.S., Padlan,E.A., Davies,D., Tulip,W.R. el al. 
(1989) Conformations of immunoglobuhn hypervariable regions. 
Nature, 342, 877-883. 

15. Dunbar,!., Fuchs,A., Shi,J. and Deane.C.M. (2013) ABangle: 
characterising the VH-VL orientation in antibodies. Protein Eng. 
Des. Sel, 26, 611-620. 

16. Chothia.C. and Lesk.A.M. (1987) Canonical structures for the 
hypervariable regions of immunoglobulins. /. Mol. Biol.. 196, 
901-917. 

17. Morea.V., Tramontano,A., Rustici,M., Chothia.C, Lesk,A.M., 
Angeletti,M.P. and Roma.P. (1998) Conformations of the third 
hypervariable region in the VH domain of immunoglobulins. 

J. Mol. Biol., 275, 269-294. 

18. Abhinandan.K.R. and Martin.A.C.R. (2008) Analysis and 
improvements to Kabat and structurally correct numbering of 
antibody variable domains. Mol. Immunol., 45, 3832-3839. 

19. Edgar,R.C. (2004) MUSCLE: multiple sequence alignment with 
high accuracy and high throughput. Nucleic Acids Res., 32, 
1792-1797. 



20. Weichenberger,C.X., Pozharski.E. and Rupp,B. (2013) Visualizing 
ligand molecules in twilight electron density. Acta Crystallogr. 
Sect. F Struct. Biol. Cryst. Commun., 69, 195-200. 

21. Wang.R., Fang.X., Lu,Y., Yang,C.Y. and Wang,S. (2005) The 
PDBbind database: methodologies and updates. /. Med. Chem., 
48, 4111^119. 

22. Kastritis,P.L., Moal.I.H., Hwang,H., Weng,Z., Bates,P.A., 
Bonvin,A.M.J.J. and Janin,J. (2011) A structure-based benchmark 
for protein-protein binding affinity. Protein Sci., 20, 482-491. 

23. Wu.T. and Kabat.E. (1970) An analysis of the sequences of the 
variable regions of Bence Jones proteins and myeloma light 
chains and their implications for antibody complementarity. J. 
Exp. Med., 132, 211-250. 

24. MacCallum.R.M., Martin,A.C.R. and Tliornton.J.M. (1996) 
Antibody-antigen interactions: contact analysis and binding site 
topography. J. Mol. Biol, 262, 732-745. 

25. Lefranc.M.P., Pommie.C, Ruiz,M., Giudicelli,V., Foulquier,E., 
Truong,L., Thouvenin-Contet,V. and Lefranc,G. (2003) IMGT 
unique numbering for immunoglobulin and T cell receptor 
variable domains and Ig superfamily V-like domains. Dev. Comp. 
Immunol., 27, 55-77. 

26. Al-Lazikani,B., Lesk,A. and Chothia,C. (1997) Standard 
conformations for the canonical structures of immunoglobulins. 
/. Mol. Biol.. 273, 927-948. 

27. North,B., Lehmann.A. and Dunbrack,R.L. (2011) A new 
clustering of antibody CDR loop conformations. /. Mol. Biol., 
406, 228-256. 

28. Lara-Ochoa,F., Almagro,J.C., Vargas-Madrazo,E. and Conrad.M. 
(1996) Antibody-antigen recognition: a canonical structure 
paradigm. /. Mol. Evol., 43, 678-84. 

29. Martin.A.C.R. and Thornton.J.M. (1996) Structural families in 
loops of homologous proteins: automatic classification, modelling 
and application to antibodies. /. Mol. Biol., 263, 800-815. 

30. Maechler.M., Rousseeuw,P., Struyf,A., Hubert,M. and Hornik.K. 
(2013) Cluster: cluster analysis basics and extensions, R package 
version 1.14.4. 

31. Riechmann.L., Clark.M., Waldmann,H. and Winter,G. (1988) 
Reshaping human antibodies for therapy. Nature, 332, 323-327. 

32. Foote.J. and Winter.G. (1992) Antibody framework residues 
affecting the conformation of the hypervariable loops. /. Mol. 
Biol., 224, 487^99. 

33. Banfield.M.J., King.D.J., Mountain,A. and Brady.R.L. (1997) 
VL:VH domain rotations in engineered antibodies: crystal 
structures of the Fab fragments from two murine antitumor 
antibodies and their engineered human constructs. Proteins, 29, 
161-171. 

34. Huang,M., Syed,R., Stura,E.A., Stone,M.J.. Stefanko,R.S., 
Ruf,W., Edgington,T.S. and Wilson.I.A. (1998) The mechanism of 
an inhibitory antibody on TF-initiated blood coagulation revealed 
by the crystal structures of human tissue factor. Fab 5G9 and 
TF.G9 complex. /. Mol. Biol., 275, 873-894. 

35. Li,W. and Godzik.A. (2006) Cd-liit: a fast program for clustering 
and comparing large sets of protein or nucleotide sequences. 
Bioinforinatics, 22, 1658-1659. 

36. Chailyan,A., Marcatih.P. and Tramontano.A. (2011) The 
association of heavy and light chain variable domains in 
antibodies: implications for antigen specificity. FEBS J., 278, 
2858-2866. 

37. Abhinandan,K.R. and Martin.A.C.R. (2010) Analysis and 
prediction of VH/VL packing in antibodies. Protein Eng., Des. 
Sel., 23, 689-697. 

38. Stanfield.R., Takimoto-Kamimura,M., Rini,J., Profy,A. and 
Wilson.I. (1993) Major antigen-induced domain rearrangements in 
an antibody. Structure. 1, 83-93. 



